Linking Linguistic Resources: Examples from the Open Linguistics Working Group

نویسندگان

  • Christian Chiarcos
  • Sebastian Hellmann
  • Sebastian Nordhoff
چکیده

The contributions of this part have described recent activities of the OWLG as a whole and of individual OWLG members aiming to provide linguistic resources as Linked Data. Here, we describe how linguistic resources can be linked with each other, and we illustrate possible use cases of information integration from various sources with example queries for the major types of linguistic resources: Using DBpedia (Hellmann et al, this vol.) to represent lexical-semantic resource, the German NEGRA corpus in its POWLA representation (Chiarcos, this vol.) to represent linguistic corpora, the OLiA ontologies to represent repositories of linguistic terminology, and languoid definitions in Glottolog/Langdoc (Nordhoff, this vol.) to represent linguistic knowledge bases and metadata repositories. We use data from German for illustration purposes, the NEGRA corpus, a linguistically annotated collection of newspaper articles from the Frankfurter Rundschau. The architecture described here, is, however, not specific to German, but can also be applied to other languages. In fact, the Glottolog/Langdoc resources have been developed primarily for language documentation and typological studies, but their applications as described below naturally extends for less-resourced languages. Christian Chiarcos Information Sciences Institute, University of Southern California, 4676 Admiralty Way # 1001, Marina del Rey, CA 90292 e-mail: [email protected] Sebastian Hellmann Universität Leipzig, Fakultät für Mathematik und Informatik, Abt. Betriebliche Informationssysteme, Johannisgasse 26, 04103 Leipzig, Germany e-mail: hellmann@informatik. uni-leipzig.de Sebastian Nordhoff Department of Linguistics, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103 Leipzig, Germany e-mail: sebastian [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud

The Open Linguistics Working Group (OWLG) brings together researchers from various fields of linguistics, natural language processing, and information technology to present and discuss principles, case studies, and best practices for representing, publishing and linking linguistic data collections. A major outcome of our work is the Linguistic Linked Open Data (LLOD) cloud, an LOD (sub-)cloud o...

متن کامل

Towards a Linguistic Linked Open Data cloud: The Open Linguistics Working Group

The Open Linguistics Working Group (OWLG) is an initiative of experts from different fields concerned with linguistic data, including academic linguistics (e.g. typology, corpus linguistics), applied linguistics (e.g. computational linguistics, lexicography and language documentation), and NLP (e.g. from the Semantic Web community). The primary goals of the working group are 1) promoting the id...

متن کامل

The Open Linguistics Working Group of the Open Knowledge Foundation

The Open Linguistics Working Group (OWLG) is an initiative of experts from different fields concerned with linguistic data, including academic linguistics (e.g. typology, corpus linguistics), applied linguistics (e.g. computational linguistics, lexicography and language documentation) and NLP (e.g. from the Semantic Web community). The primary goals of the working group are 1) the promotion of ...

متن کامل

The Open Linguistics Working Group

This paper describes the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). The OWLG is an initiative concerned with linguistic data by scholars from diverse fields, including linguistics, NLP, and information science. The primary goal of the working group is to promote the idea of open linguistic resources, to develop means for their representation and to encourage ...

متن کامل

Publishing and Linking WordNet using lemon and RDF

In this paper we provide a description of a dataset consisting of data from the Princeton WordNet. This version is intended to provide canonical URIs that can be used by a wide variety of lexical resources to express their linking as part of the Linguistic Linked Open Data Cloud. Furthermore, this is the first version to use the lemon model and we describe how we represent WordNet with this model.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012